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When modelling driven steady states of matter, it is common practice either to choose transi- 
tion rates arbitrarily, or to assume that the principle of detailed balance remains valid away from 
equilibrium. Neither of those practices is theoretically well founded. Hypothesising ergodicity con- 
strains the transition rates in driven steady states to respect relations analogous to, but different 
from the equihbrium principle of detailed balance. The constraints arise from demanding that the 
design of any model system contains no information extraneous to the microscopic laws of motion 
and the macroscopic observables. This prevents over-description of the non-equilibrium reservoir, 
and implies that not all stochastic equations of motion are equally valid. The resulting recipe for 
transition rates has many features in common with equilibrium statistical mechanics. 

PACS numbers: PACS: 05.20.-y; 05.70.Ln; 83.50.Ax 

I. INTRODUCTION 

I address the question, what is the appropriate stochastic equation of motion to use when modeUing a driven 
steady state (including chaotic and fluctuating steady states) such as that of a fluid under continuous shear flow? At 
equihbrium, the solution is well understood. To generate configurations consistent with the equihbrium ensemble, 
one may use any equation of motion that respects the principle of detailed balance, which is a constraint on ratios 
of forward and reverse transition rates. That condition ensures that every thermally-driven flux is balanced by 
an equal and opposite flux. For non-equilibrium systems in continuously driven steady states, no such guidance is 
hitherto available in choosing an equation of motion consistent with the mechanically (externally) driven fluxes, so 
arbitrary choices are often made. The aim of this work is to eliminate arbitrariness, and determine what transition 
rates are implied by the macroscopic state of the non-equilibrium system, i.e. its mean energy and flux, combined 
with our knowledge of the microscopic laws of physics. The objective is to use only the information that is available, 
without unwittingly introducing any arbitrariness, deriving from personal prejudices. The method for keeping the 
amount of information constant throughout the calculation is Jaynes' information-theoretic method of maximum 
entropy inference P, (MaxEnt) , which is often misunderstood in the context of non-equilibrium thermodynamics, 
despite recent notable achievements 0. It has been successfully used to derive fluctuation theorems and linear 
transport theory Q , and to explain self-organised criticality T\ . 

Jaynes gives a nice explanation of maximum entropy inference in his original paper on the subject P], where he 
uses the method to re-derive equilibrium statistical mechanics without the need for many microscopic details that 
had previously been considered necessary. The application of the method to equilibrium systems is uncontroversial. 
However, the history of non-equilibrium information theory can be confusing because it has been used in so many 
different ways, some of them exact, some only approximate. In fact, information theory itself is not a physical 
theory, but a mathematical method, providing a logical structure. Some physical input is required if such a method 
is to make physical predictions. If one throws away too much relevant information about some non-equilibrium 
system before applying MaxEnt, it will still provide answers, but they will be inaccurate. For instance, using the 
method to minimise the information content of the momentum distribution in a non-equilibrium gas, although 
efficacious, is not an exact method, as was recently shown 0. In fact, there is no justification for discarding 
all information content except for some averaged features. Indeed, particles possess their individual velocities 
for a reason: they have each come from somewhere, and are going somewhere, and their journeys will affect the 
trajectories of other particles. These facts are relevant to the physics of a non-equilibrium system, and lead to 
temporal correlations. 

At the other extreme, if one retains all the details of a system's phase-space trajectory, allowing no stochastic 
input (e.g. from a reservoir), then MaxEnt becomes a null procedure, since it is asked to choose the most likely 
distribution from a choice of only one physical scenario - a delta function distribution of trajectories. Such a null 
procedure may be regarded as an extreme case where MaxEnt can correctly "predict" any and all physics. There 
is thus no reason in principle why MaxEnt should be expected to fail in non-equilibrium situations, if we ask it the 
right questions. 

The choice of the prior set of options that is presented to MaxEnt is of crucial importance. It should be a set of 
physical paths through phase space, that each obeys Newton's laws, so that all physics (the Navier-Stokes equation, 
long-range correlations, etc.) is respected a priori. MaxEnt then tells us which of these trajectories is most likely 
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to be chosen, under the influence of a non-equilibrium reservoir that is coupled to the system but uncorrelated with 
it[l^. This is the application of information theory that should be understood here and in Refs. 0,I3|- I derive its 
implications for transition rates. 

Alternatively, as is often done in theoretical modelling, one can settle for a physically imprecise prior set of 
dynamical rules — such as Brownian particles, or a discrete state space, or discrete time steps — so that things 
become easy to solve. Then applying the methods below will, accordingly, yield only approximate physics, but at 
least one will know exactly what information went into the simplified model. Such an application of the present 
theory would yield transition rates that are somewhat arbitrary, due to the arbitrariness of the prior rates that 
are chosen. However, it will provide strictly the least arbitrary model. Such a model will be derived in section Hvl 
for a stochastically hopping particle, that demonstrates some features of the method. It is important to realise 
that the approximations introduced in that section are only for expediency in that particular model. The general 
derivation of the method for obtaining transition rates from prior dynamical rules combined with non-equilibrium 
macroscopic observables, presented in sections Ull and Hill remains exact. 

The conditions derived here for macroscopically driven steady states are analogous to the equilibrium principle 
of detailed balance. Like detailed balance, the conditions are not sufficient to completely determine the microscopic 
transition rates, but are necessary to be satisfied by any equation of motion that generates an unbiased ergodic 
driven steady-state ensemble. The derivation of detailed balance relies on two assumptions: time-reversal symmetry 
of the microscopic laws of motion, and the ergodic hypothesis which implies that a heat reservoir can be char- 
acterised by the Boltzmann distribution with temperature as the only parameter. Similarly, the non-equilibrium 
conditions assume the same microscopic laws that govern equilibrium motions (therefore implicitly requiring micro- 
scopic time- reversal symmetry, broken only by imposition of the macroscopic flux), and rely also on a hypothesis of 
ergodicity implying that the driven reservoir is fully characterised by its macroscopic observables (mean energy and 
flux). Many quiescent systems (those without fluxes) are at thermodynamic equilibrium, but exceptions include 
glasses 0, granular media and certain cellular automata 0, in which the ergodic hypothesis and/or 

microscopic reversibility fails. Boltzmann's law and the principle of detailed balance apply only to that class of 
quiescent systems that are, by definition, at equilibrium. That class of systems has of course proved to be large, 
significant and interesting. Similarly, not every non-equilibrium steady state should be expected to respect the 
conditions presented here; exceptions include traffic flow and fluids of molecular motors, in which the constituents 
violate time reversal symmetry. The ergodic hypothesis may also fail in some systems, implying that hidden in- 
formation that is not apparent in the macroscopic observables is nonetheless signiflcant. However, it is anticipated 
that the ergodicity criteria are respected by the transition rates of many macroscopically driven systems, defining 
a special and important class. 

The method outlined in section ^ was presented in a recent Letter j2| . It is explained here in more detail, and 
the analysis extended to an alternative non-equilibrium ensemble in section iHll The method is demonstrated in 
section IIV Al where rates are derived for the stochastic transitions of a particle hopping in a non-trivial energy 
landscape, subject to a driving force. Applications to other models are also discussed in section Hvl 

II. THE METHOD 
A. Information entropy 

Using Jaynes' interpretation of Gibbs' entropy [J, it is possible to make a "Maximum Entropy Inference" 
to assess the probability that a system, subject to random influences, (whether at equilibrium or not) takes a 
particular trajectory Fq through its phase space, thus allowing us to assess the reproducible part H), l3| of the 
system's motion. The recipe for the probability p(Tq) of trajectory Tq is to maximize the Shannon entropy, or 
information entropy, 

5z^-^p(r)inp(r) (1) 

r 

subject to constraints that some averaged properties of the trajectories conform with our knowledge of the macro- 
scopic features such as mean energy, volume, flux etc. 

In principle, this formalism gives us a full solution of the statistics of any ensemble, be it at equilibrium or not. 
In the absence of any macroscopic fluxes (i.e. at equilibrium), the prescription reduces to a maximization of the 
Gibbs entropy with respect to a distribution of instantaneous states rather than trajectories, yielding Boltzmann's 
law. In the non-equilibrium case, MaxEnt gives us the probability of an entire trajectory Fq. It would be more 
useful to have a formula for the probability of a short segment of the trajectory, a single transition from a state 
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a to a subsequent state b. Such a transition probability is what we require for designing a stochastic model or 
simulation. This would allow us to generate trajectories belonging to the non-equilibrium ensemble. Let us now 
derive that formula. We begin with some trivial calculations to establish notation. 



B. Prior probability 

At any instant t, the entire state of a system is represented classically by its phase-space position vector x{t). 
This is a high-dimensional vector specifying the positions and momenta of all the particles constituting the system. 
As time progresses from the beginning t = to the end t — tq of the experiment or simulation, x(t) traces out 
a trajectory Tq through phase space. It will prove useful to label each probability distribution function with a 
subscript indicating the duration of the trajectories to which it applies thus: pro{To). For a deterministic system 
with definite initial conditions, only one trajectory is possible, so the probability distribution is a delta function. In 
the presence of randomness, such as coupling to a reservoir of systems with similar properties, the distribution is 
finite for all trajectories that respect some prior dynamical rules, such as conservation of momentum for all internal 
degrees of freedom not directly coupled to the reservoir. 

In the absence of any posterior constraints other than normalization, 

J2pro{r) = i, (2) 
r 

all trajectories of a given duration tq have equal a priori probability. That is not an independent postulate, but is 
embodied in the maximum entropy principle of information theory since the entropy-maximizing distribution 

is given by 
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J2 { -Pro(r) inp.„(r) + Ap,„(r)} = o (3) 

r 



with a Lagrange multiplier A chosen for consistency with Eq. Equation (jSJ is solved by pTo(ro) — constant, 
indicating that the unconstrained ('prior') set of trajectories of a given duration have equal probability. 



C. Equilibrium ensemble 



We now impose a posterior constraint, and calculate the statistical properties of that sub-set of trajectories that 
respect the constraint. Let us not necessarily conserve the energy E of the system at each instant (since we allow 
energy exchange with a reservoir), but rather demand that its time- average over the whole trajectory Fg is fixed 
at Eq. We shall use a bar to indicate time averages, so that 

E^^ = - [ " Er„{t) dt = Eo. (4) 
To Ja 

Let us divide the trajectory Fq into shorter segments F, each of duration r. Then the constraint on the time- 
averaged energy may be written 

tY^E^^toEo. (5) 
r 

Assuming ergodicity, time-averages are equivalent to ensemble-averages in the limit Tq/t —* oo. So this constraint, 
for a time- average on Fq, defines the equilibrium canonical ensemble for F. In other words, the conditional probabil- 
ity pT-(F|i?ro = Eq) of encountering a particular trajectory segment F of duration r, given that the whole trajectory 
has a time-averaged energy Eq, is found by maximizing the information entropy for F subject to Eq. ((SJ. The 
maximization involves Lagrange multipliers /3 for this energy constraint, and for the normalization constraint, 
and yields 

pAT\Eo)^ Z-^cxpi-f3E^) 

where the condition Er^ — Eq is represented for brevity by Eq. As expected, this is Boltzmann's law, and we inter- 
pret the Lagrange multipliers as the temperature parameter /3 = l/kgT and partition function Z = exp(— /JSp)- 
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D. Transition rates 

A transition rate, for any transition between states a b say, can be written as a conditional probability. If 
we consider a trajectory segment F' of duration At, representing the transition a ~> b, then the transition rate at 
some time, which we may define without loss of generality to be t = 0, is 

<'"=PAt(r'|a;(0) = a)/Ai. (6) 

This is the probability (per unit time) of encountering the trajectory V = {^^(O) = a, x{At) — 6}, given that we 
begin at a. Equation © gives the prior rate of the particular transition. The rate in the equilibrium ensemble is 
given by a probability subject to two conditions: 

LuP=pAt{r'\a,Eo)/At (7) 

where the condition x{Q) = a is represented for brevity by a. To re-cap, Eq. {Tj) defines the probability of 
encountering trajectory segment T' (a transition a ^ h) given that we are in state a, and that the entire trajectory 
Fq of duration tq will turn out to have a mean energy Eq. 

E. Driven ensemble 

We have looked so far at the prior phase-space trajectories, and those for systems in the equilibrium ensemble. 
Our goal is to determine the transition rates appropriate to a non-equilibrium ensemble, for which there is an 
imposed flux J. Again, we should not over-constrain the dynamics. Let us allow the flux to fluctuate, and demand 
only that the dynamics will result in some finite value Jq of the flux time-averaged over the whole trajectory: 

1^,^-1 Jroit) dt = Jo. (8) 

To Jo 

We ask, what is the probability, in time At, of encountering the transition V = {a ^ b}, given that we begin in 
state a, and that the dynamics will eventually conspire to produce a mean flux Jq and energy Eof Again we relate 
this conditional probability to a transition rate: 

uj^'i =PAtir'\a,Jo,Eo)/At. (9) 

Fig.^depicts some of the trajectories that have been discussed. Time t is shown on the horizontal axis, and all 
trajectories have a total duration tq. The vertical axis represents the phase-space coordinates though, of course, 
this is a reduced representation of the vastly high-dimensional phase space, since it has been projected onto a single 
axis. For deflniteness, let us say that this axis represents integrated flux, i.e. the flux that the system has accrued 
since t — 0. We must imagine that all the other coordinates required to fully describe the state of the system, are 
on axes perpendicular to the page. 

A sample of trajectories representing the equilibrium distribution is shown (in grey and black). These trajectories 
are concentrated close to the time axis (zero flux). If another axis measuring instantaneous energy E{t) were 
constructed perpendicular to the page, then the density of trajectories would be exponentially distributed along 
that axis, by Boltzmann's law. Equation Q gives the frequency of observing a particular trajectory segment shown 
in Fig. n (the single transition a — > 6) of microscopic duration At, given that we are currently (at t = 0) in state 
o, and that the whole trajectory belongs to this equilibrium set. Equation (j^ asks for the frequency with which 
that trajectory segment {o — > 6} occurs in the sub-set of trajectories shown in black in Fig. ^ for which a given 
integrated flux will be accumulated by time t = tq. This sub-set of trajectories is the driven ensemble. 

Note that we shall not require At to vanish. The discussion will cover discrete-time processes for which the 
microscopic time step is At = 1, as well as continuous-time dynamics for which At — > 0. 

F. Bayesian evaluation 

To mathematically manipulate conditional probabilities, we appeal to Bayes' theorem. It states that the joint 
probability of two outcomes X and Y both occurring, given a third fact Z, may be written in two equivalent ways: 



p{X\Z)p{Y\X, Z) = p{Y\Z)p{X\Y, Z) 



(10) 
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FIG. 1: A sample of phase-space trajectories of duration to- One phase-space coordinate (cumulative flux) is shown as a 
function of time t. At equilibrium, most trajectories lie near the horizontal axis. The sub-set of trajectories shown in black 
belongs to the microcanonical-flux ensemble. Their time-averaged flux is Jo. The probability of a transition, in microscopic 
time At, between the phase-space points a and b shown, is enhanced for this sub-set, relative to its equilibrium likelihood. 



where p is simply used to mean 'probability' for any event (appropriately normalised), as opposed to a particular 
distribution function. We can now assign the following meanings: X is the fact that the transition a b takes 
place within At, represented by the trajectory F'; Y says that the flux has a mean value J^g — Jo averaged over 
the entire duration tq; and Z is the combined statement that the initial state at t = is a and that the trajectory's 
time-averaged energy will be Er„ = Eq. Thus, Eq. (|10|) expresses the probability of the transition taking place 
within At and the flux averaged over tq being Jq, for the given initial state and average energy. It is re- written 
thus: 

Pro{Jo\a, Eq) pAt{T'\a, Jo, Eq) = pAt{T'\a, Eq) pro{JQ\a,r' , Eq). (11) 

Notice that it is redundant to specify the two conditions {a,T'), since the trajectory segment V is the transition 
a b which includes the initial state a. Substituting from Eqs. Q and yields a theorem for transition rates in 
the driven steady-state ensemble, 



,dr _ , cq p°q(Jo|a ^ b ) 

ro^oo p^[J(Jo|a) 



Notice that all quantities on the RHS of Eq. (|12|l are defined at equilibrium, not on the driven ensemble. This 
is indicated by the superscript 'eq', which is equivalent to the condition fixing Er^, the time-averaged energy. 
Equation (|12l) tells us that the transition rate in the driven ensemble is given by the transition rate in the equilibrium 
ensemble, multiplied by an enhancement or attenuation factor. We shall see below that the theorem makes intuitive 
sense. 

Given that the dynamics must be consistent with the macroscopically observable mean energy and flux, and 
with the same microscopic laws of motion that hold sway in an equilibrium system, MaxEnt yields an unbiased 
description ofthe dynamics, and thereby constrains the system the least. Eauation H12|l specifies explicitly the 
dynamical rules implied byMaxEnt. How do we know that Eq. H12|) constrains the dynamics the least? It does, 
because all the quantities on the RHS are defined for the maximum-entropy ensemble at equilibrium, i.e. without 
the extra constraint on the flux. Given that we start with an unbiased set (the equilibrium ensemble), Bayes 
theorem gives us the least biased set subject to the extra posterior constraint. 

Let us examine the enhancement factor in Eq. H12|) in detail. It is a ratio of conditional probabilities for 
encountering a flux Jq in the equilibrium ensemble. Of course, we do not expect a system at equilibrium to exhibit 
any net flux, averaged over its whole trajectory. The chance of such a flux arising spontaneously at equilibrium 
is vanishingly small (as tq oo). So the RHS of Eq. H12(l is the ratio of two vanishingly small terms. However 
unlikely it may be for an equilibrium system to spontaneously exhibit the desired macroscopic flux, we ask, how 
much would that probability be enhanced as a result of the putative transition a — > &? If the dynamics of the 
transition itself contributes some flux to the trajectory, it is favoured by the enhancement factor. The factor also 
favours transitions to configurations that give a greater than average probability of subsequently obtaining the 
desired flux, for the given starting point. If the new state b is more likely to initiate high-flux trajectories, then the 
transition rate is boosted over and above the equilibrium rate. 



6 



We shall examine the implications of Eq. (|12|l in some examples, but firstly let us interpret the meaning of its 
derivation. Imagine that a lazy physicist wishes to collect data from a driven steady state, such as continuous shear 
flow of a complex fluid. Our physicist has a computer program that simulates the fluid at equilibrium (with free 
or frictionless boundaries, say), and is too lazy to write a new program that simulates shear. Instead, (s)he runs 
the equilibrium simulation, in the hope that it will spontaneously exhibit shear flow. It does not. So the dilettante 
updates the program's random number generator and runs it again. Having tenure and little imagination, the 
physicist repeats this process countless times until, one day, the fluid fluctuates into a state of sustained shear flow. 
The delighted simulator records this fluke, but continues the project for many more years until a large number of 
such accidents have been observed, exhibiting the same shear rate. Finally, the researcher discards an enormous 
set of simulated trajectories, and publishes only that subset which happened to perform the desired shear. On 
analysing this subset of trajectories, one might expect to observe the equilibrium transition rates that were coded 
into the algorithm. But this is a biased data set, subject to an a posteriori constraint of shear flux Jq. So this sub-set 
of the equilibrium ensemble exhibits exactly the transition rates specified by Eq. (|I2|I . Although the programmer 
has published a biased account of the equilibrium simulations, there was no unwarranted or subjective bias other 
than the flux constraint, hence the project was a success in producing the physics of shear flow. 

Note that, despite extracting a sub-ensemble from the equilibrium ensemble, the lazy physicist has not produced 
a near-equilibrium approximation. The "sub-ensemble dynamics" of Eq. (|I2|I has features qualitatively different 
from the equilibrium dynamics. 

In section llVl I shall use some examples to demonstrate the correctness of the physics generated by sub-ensemble 
dynamics (Eq. H12|l'). Before doing so, in section UTTl I develop a useful variant of Eq. I|12|l . analogous to an 
alternative thermodynamic ensemble. 

III. ALTERNATIVE DYNAMIC ENSEMBLES 

A. Microcanonical-flux ensemble 

Equation p2|l gives the frequency of observing a particular trajectory segment (e.g. a single transition a ^ b) 
of microscopic duration At, in the driven ensemble which is a sub-set ofall trajectories, shown in black in Fig. ^ 
These trajectories lie in the extreme tails of the equilibrium distribution. Note that they have common end points, 
since we have specified the exact net flux that must flow during the duration of the experiment. Any nearby 
trajectories, that do not have exactly the specified flux, do not contribute to the quantities appearing in Eq. H12I) . 
Even very nearby trajectories are completely discarded by Eq. (|I2|) . This can be seen by re- writing the probability 
of thespecified flux Jq as a sum over trajectories To with fluxes Jp,, , so that Eq. H12() becomes 

/dJp",q(J|a^6) S{J-Jo) 

J dJ p-'Ma) SiJ - Jo) ^ ' 

EroPr:?(ro|a-^b)^- Jo) 
J2r„Pro(^o\a) 5(7^ - Jo) 

Here, the Dirac delta functions kill all trajectories with anything but the exact net flux Jq. This can be a 
disadvantage for practical applications of the formula. (The lazy physicist, discussed above, must discard data 
even from simulations that produce nearly the right flux.) An alternative expression is now derived, that samples 
trajectories with less stringent conditions on their eventual flux content. 

B. Canonical-flux ensemble 

In equilibrium statistical mechanics, the constraint of energy conservation is relaxed by dividing the isolated 
microcanonical system into a relatively small sub-section, defining the canonical system, and the large remainder, 
known as the reservoir. Similarly, we shall relax the strict constraint on the time-averaged flux, by dividing the 
total trajectory of duration tq into a part (see Fig. ^ of duration t (where At <^ t <^ tq), whose properties 
are examined in detail, and the large remaining part of duration f = tq — r, for which the system's motion is 
uncorrelated with the earlier trajectory segment. 

We may express the conditional probability Pro(Jol'i) of a net flux Jq in the full duration tq, as an integral over 
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all possible fluxes during the interval r thus: 

/oo 
AJp%^{J\a)pf{J\a,J) (14) 
-oo 

where pf{J\a, J) is the probability of an appropriate flux J during interval f given that the system began in state 
a at t = 0, and then flowed with mean flux J for the duration r. The required flux J is given by 

tJ + tJ^tqJq. (15) 

Given that r exceeds any correlation time Tcor, the probability pf(J|a, J) becomes independent of a, because the 
system has forgotten its initial state by the time r at which the interval f commences. In fact, at time r, the system 
is in a state drawn at random from the driven steady-state ensemble, since the integral in Eq. H14|) is dominated 
by J ~ Jq. So we may make the replacement 

p^(J|a,J)^^/^(c)p?'>(J|c) (16) 

c 

where p'^^{c) is the steady-state distribution of instantaneous microstates in the driven ensemble. Not only is the 
above formula independent of the initial state a, it actually takes a universal (exponential) form as a function of J, 
as shown in Appendix ^ using the theory of large deviations. This is because the extremely unlikely value of the 
flux, J, is the result of many unlikely realisations of the flux during the many uncorrelated intervals that comprise 
the large duration f. As a result, Eq. ({T^ can be re-cast, using Eqs. (fTIj) . ({TB|l and lfTB|l . and the derivation in 
Appendix 1X1 as 



:\ r/.„..^oo /dJp^^(J|a)e-^ 



where the control parameter v is conjugate to the time-averaged flux, and is flxed by the relation 

dQ 



in terms of the function 



g(i.) ^ lini _l_JdL, (19) 

Here, (. . .)(ir is an ensemble average with respect to the steady-state distribution of microstates p^^{c). We have 
defined 

/oo 
p^'i(J|c)e^''''dJ (20) 
-oo 

that is a property of an instantaneous state c of the system. Note that mc{v,T) has non-trivial r-dependence, 
containing transients for r < Tcor, and becoming linear in r for r ^ Tcor, while Q{v) is independent of t. 

The above equations have a structure that is familiar from equilibrium thermodynamics. Clearly, in Eq. H19I) . Q 
plays the role of a thermodynamic potential, audits derivative J is conjugate to the temperature-like parameter v. 

The conditional probabilities in the integrands of Eq. I|17|) describe the likelihood of any particular flux during 
the interval t, given the initial state and/or transition. The exponential factor measures the change in the weight of 
the large remainder of the trajectory of duration f, due to the initial part accepting a flux J rather than postponing 
it until after t. 

Compare Eqs. and H17|) . The expressions become very similar under a change of notation t ^ tq. The 
difference in the new formulation (Eq. (|17() ') is that trajectories with the wrong flux are not eliminated by a delta 
function, but re-weighted by an exponential weight factor. 

The two alternative formulations are exactly akin to alternative ensembles in equilibrium statistical mechanics. 
We can regard the duration of a trajectory as being analogous to the size of a system at equilibrium, and the 
flux as analogous to energy-density. Originally we demanded that the integrated flux was flxed exactly, just as 
energy is fixed in the mico-canonical ensemble, and we enquired, in Eq. \V6\ . about how the instantaneous ('local') 
conditions are affected by correlations in the rest of the trajectory ('system'). The formulation of Eq. H17|) is akin to 
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using the canonical ensemble. Again, we enquire about conditions at an instant ('locality') At, within a trajectory 
segment ('system') of duration ('size') r. But now, the integrated flux ('energy') is not strictly conserved, but can 
be exchanged with the rest of the trajectory ('a reservoir') of duration ('size') f much longer ('larger') than the 
initial trajectory segment ('system'). Since all important correlations are contained within the 'system', the nature 
of the interface between 'system' and 'reservoir' becomes unimportant, and the 'reservoir' is characterised by a 
single parameter, v. Let us refer to this as the'canonical-flux' ensemble. So long as the trajectory duration ('system 
size') is much greater than any correlation time ('length'), the properties at instant ('locality') Ai are unaffected 
by whether integrated flux ('energy') is exactly conserved, and the ensembles are equivalent in the infinite-time 
('thermodynamic') limit. 

It is possible to derive Eq. (|17() [via Eqs. ((7|) and ^] by direct maximization of the information entropy of a 
set of trajectories, at fixed ensemble-averaged flux and energy. In that case, as with the above derivation, great 
care is required to compare the relevant time-scales with correlation times, to avoid unwittingly averaging over 
the correlations present in p°'^( J|a b). That would produce a mean-field expression, in which the rate of each 
transition is simply boosted exponentially according to its immediate flux contribution. Such a scheme is popular 
in simple models, but should not be mistaken for the exact theorem derived above. 



C. Factors affecting transition rates 

The expression for transition rates, Eq. H17|l . appears to depend on the arbitrary quantity r. It can be re- written 
in an a much clearer form that is explicitly independent of r, as we now show. 

Although mc{v,T) ^ oo as r — > c», the difference mi, — nia, for two states a and b, has a finite asymptote, 
embodying the different transient influences that the two states have on the system, before it returns to a statistically 
steady state. So, let us define a function that contains that transient information, but is independent of the 
arbitrary quantity t, thus: 

Qaiv) = lim {r7io(i/,T) - r(5(j/)} (21) 
= In lim (22) 

\ / dr 

1 /p°q(J|a)e-^dJ 
-l\l-./(,-(J)),,.e-dJ ^''^ 

so that Qa — Qb = — rub in the long-time limit. 

We require one further piece of notation. The dynamics is described by a set of transitions a b carrying 
integrated flux Jat At. For continuous dynamics. Jab At ^ as At ^ 0, but for discrete transitions. Jab At 
remains finite whether or not time steps are made vanishingly small. As above, the following discussion applies to 
either case. 

In terms of these physically meaningful quantities, transition rates in the driven ensemble are given by 



exp [vJab At + qbiv) - qa{v) - Q{v)/S.t] 



^f-^b = '^alb exp [vJab At + qb{v) - qa{v) - Q{v)M] . (24) 



The derivation of Eq. H24|) from Eq. (|17|l is given in Appendix ^ It is now clear, in Eq. H24|l . that three distinct 
factors determine the rate of a transition a — > 6 in the driven steady-state ensemble. (1) The rate is proportional 
to the rate at equilibrium. So, all else being equal, energetically expensive transitions are slow, while down-hill 
transitions take precedence. (2) The rate is exponentially enhanced for transitions that contribute a favourable flux. 
(3) The dependence on gt, — qa is overlooked by mean-field models. It says that a transition's likelihood depends 
also on the state in which it leaves the system. Its rate is enhanced if it puts the system into a state that is more 
likely to exhibit flux in the future. The effect of this factor on the driven steady-state distribution of microstates 
is to increase (relative to the Boltzmann distribution) the weight of states that are more-than-averagely willing to 
accept a future flux. We shall see an example of this effect in the next section. In the case of a shear flux, this 
means that low- viscosity states are favoured, as is often observed. 



IV. APPLICATIONS 



We have a recipe for constructing a model of any given driven system, that is guaranteed to yield the desired 
flux, and to respect all the physical laws that are obeyed by the equilibrium version of the model, and that is 
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Energy 
A A 

E 

V 

Position X 

FIG. 2: Comb-shaped state-space for the hopping model. Circles represent the possible states of the system, of types a and 
p. The single particle, shown as a filled circle, currently occupies an a state. It may escape into other a states to the left 
or right with rates L and R respectively, or downwards in energy, with rate D, to state /3. Each /3 state can be exitted with 
rate U upwards to the a state at the same location only. 



guaranteed to have no artefacts from statistical bias. If we choose to provide this machinery with an equilibrium 
model that obeys all of Newton's laws — i.e., a fluid whose internal interactions conserve momentum, angular 
momentum, and energy, while stochastic forces from the reservoir couple only to particles at the boundary — 
then it will yield dynamical rules that also respect Newton's laws for the boundary-driven fluid. In other words, 
the method has the capacity to produce exact physics if provided with an exactly physical prior. It provides a 
description of the reservoir, by characterising the stochastic part of the equations of motion. The way in which 
that reservoir couples to the system is up to the user to decide. In the above example, it is coupled only at the 
boundary, but we may instead consider a driven Brownian system, for which the heat bath is more strongly and 
uniformly coupled, dominating all momentum variables. Another alternative is to apply the method to a model 
whose prior (equilibrium) physical properties are simplified for the sake of clarity and analytical expediency. In 
that case, of course, the result of the recipe will be approximate and unreliable, but still the least arbitrary choice 
of transition rates for the given degree of simplification. 

The micro-canonical flux ensemble introduced in sections ^ and IIII Al was first presented in Ref . (3] , where it 
was used analytically to construct a continuum model of driven diffusion, and heuristically to discuss the features 
of a lattice model of dimers under shear. The latter model had a much more complex energy landscape including 
jammed states. Another analytically solvable model was constructed in Ref. 8], using the micro-canonical flux 
method. It was another one-dimensional driven diffusion model, but this time with a discrete state space and 
discrete time step. In the following section, we shall analytically construct a model of a driven system with a 
non-trivial energy landscape, that demonstrates some features of more complex systems, such as sheared complex 
fluids, with states that are locally trapped so that they cannot easily be driven. The model reduces to simple one- 
dimensional driven diffusion in a certain limit, and has a discrete state space but continuous time, to complement 
the earlier published models. We shall use the canonical- flux ensemble of sections IlII Bl and IIII CI to demonstrate 
the utility of this method. 

A. The model 

Consider a particle that can hop stochastically among a set of discrete states that have the connectivity shown 
in Fig. |2J The particle will be driven by its non-equilibrium heat bath so that it has, on average, a drift velocity u 
from left to right. At each location x, it may occupy one of two states: state a, from which it may escape to the 
left or right with rates L and R to other a states at different locations x, or downwards with rate D into a lower 
energy trapped state /3; once in a /3 state, the particle cannot exhibit any flux, i.e., cannot move left or right, but 
can only wait for a random excitation at rate U back up to the a state at the same location. 

Note that, if we set Z) = 0, the model reduces to a continuous-time discrete-space linear hopping model, like the 
versions that were previously studied with both space and time continuous ^ or discrete 

The equilibrium version of this model (with no mean drift) is very straightforward. Detailed balance requires 
that V"^ = D'"^ ex-p{—E), where E is the energy difference between states a and jS measured in units of fc^T, and 
that i?'^'^ = L^"^ = ujQ where we may measure all rates in units of ujq so that ujq = 1 without loss of generality. The 
occupancy of a states is given by Boltzmann as 1/(1 + cxp(i?)), and the only remaining parameter that we are free 
to choose is 0°"^ = p, which specifies the ratio of vertical to horizontal mobilities. 

When the model is not at equilibrium, but is driven at drift velocity u, the nai've expectation would be either 
that we a free to choose all four rates U, D, L, R, since non-equilibrium models traditionally have no rules, or that 
detailed balance still governs the ratio U/D. However, as discussed above, neither of these statements is true. 
There is, in fact, a least-arbitrary set of rates, that corresponds to driving by an uncorrelated heat bath that is 
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characterised only by its temperature and velocity. We shall now calculate that set of rates, using the canonical-flux 
ensemble. 

The rates are given by Eq. H24I) . Defining our unit of length to be one inter-site spacing, the integrated flux of a 
transition to the right (left) is Jr At — 1, ( At = —1), while transitions between a and P states carry no flux as 
they leave the particle's displacement unchanged. Since time is continuous, the time-step is infinitesimal. At — > 0, 
so that the last term in the exponential of Eq. (|24l) vanishes, and it prescribes the following rates in the driven 
ensemble: 

R = e'' (25a) 
L = (25b) 
D ^ pe^"-'" (25c) 
[/ ^ C/cqg,„-,3 (25d) 

To complete the calculation of the rates, we require only qa — qp, the difference in the willingness of the two states 
to admit a flux. This could be evaluted by "brute force" using Eq. (|23|) , if we first calculate the Green function 
for the equilibrium model, i.e. the probability that the particle travels a given distance in a given time, given the 
intial state a or /?. However, that calculation can be avoided, using the derivation in Appendix lUl to show that, for 
this "comb" model, 

qo,iiy)-qp{,y)^Hl + Q{,y)/U^^). (26) 

This is purely a result of the facts that state /3 can only be quitted via state a, and that escape times are distributed 
exponentially. 

We can now construct a differential equation for Q{v), as follows. Due to the model's translational symmetry, 
the steady-state occupancy of a states is just 

/a = — ^ (27) 

and, since displacements are allowed only from a states, the mean drift velocity is 

u={R-L) /„ = 2/„ sinh V. (28) 

Now, using Eq. (|18|) . we obtain an ordinary differential equation, 

dg _ 2(pe--^-|-Q)2sinhi/ 
d^7 ~ p^e-E + {pe-E + QY 

that can be integrated for cosliv as a function of Q. The constant of integration is fixed by Q(0) = which follows 
from normalization of the probability distribution in the definition of Q [Eqs. H19() and H20|l ]. Finally, we obtain 
the required "potential" , 



Q{v) = cosh{v) - 1 - ^(1 + e-^) + \j cosh(t/) -1-^(1 



2 



P, 



2 



which, with Eqs. (|25|l . (|26|1 . H27|l and (|28|l . leads to four constraints on the four transition rates in the driven system, 
from which the abstract quantities v and Q have been eliminated: 

R-L = {l + DlU)u (29a) 
RL = 1 (29b) 
UD ^ p2g-B (-29c) 

R + L + D-U = 2 + {l-e~^)p (29d) 

One of these four equations is obvious; the others are not. Equation (|29a|l is simply a re-statement of Eq. 1)28(1 . 
and gives the drift velocity u that results from any choice of the four transition rates. So, if we applied the usual 
ad hoc construction of non-equilibrium stochastic models, we would pluck four rates out of the air, use Eq. I|29a|l 
to find the resulting drift velocity, and have no other constraints. The other three constraints have arisen from our 
demand that the design of the model incorporates the prior dynamics, the large-scale flux, and no other design 
features. 
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FIG. 3: Rates R, L, U and D for the driven comb model, as functions of the drift velocity u, using the parameters E — 2, 
p = 0.5. 



Note that Eqs. (|29b|l and (|29c(l express relations between forward and reverse transition rates that are generic 
to any continuous-time model with instantaneous transitions, that the product of the forward and reverse rates of 
a transition is equal in the driven and equilibrium ensembles. This follows directly from Eq. \'2,A\ with Jah Ai finite 
as M 0. 

The four transition rates defined by Eqs. H29|l have exactly the same number of free parameters as in an equilib- 
rium model: for a given energy gap E and flux u, the four rates are defined up to one parameter, p, that specifies 
the prior ratio of vertical to horizontal mobilities, as is the case in the equilibrium version of the model that was 
required to respect detailed balance. While the equilibrium occupancy, given by Boltzmann's law, is independent 
of p, the occupancy in the driven ensemble [(Eq. (|27|) ] does depend on this kinetic parameter. 

B. Properties of the model 

The transition rates prescribed by sub-ensemble dynamics are plotted as functions of velocity u in Fig.|31for an 
energy gap E — 2 and mobility ratio p — 0.5. Due to the symmetries of the comb structure, the rates of transitions 
up and down {U, D) between a and (3 states are even functions of u. At u = 0, the rates take their equilibrium 
values, R = L = 1 and D = UexpE. On increasing velocity, hops to the right (i?) become more frequent, while 
hops to the left (L) are suppressed, as expected. Also the particle becomes less likely to fall down (D) into a 
trapped /3 state, and is increasingly dragged out of traps (U) by the driving force. 

The rates are re-plotted on log-log axes (for positive u) in Fig. 01 using parameter values E = 6, p = 100, that 
were chosen to provide a separation of time-scales, emphasizing the features of the graphs. Three regimes of drift 
velocity u become apparent. On the left of the figure (low u) is the near-equilibrium regime, where the rates D, U, 
of transitions that do not carry a flux, remain approximately constant, respecting detailed balance. This fulfils the 
nai've expectation often applied to non-equilibrium models, that detailed balance continues to describe the physics 
of activated processes. Meanwhile, the rate of hops to the right, R, is enhanced and to the left, L, is suppressed, 
so that the sparsely populated a states exhibit the required drift velocity. 

The second regime of the driving velocity, luq < u < p is shaded grey in Fig. ^ In this regime, the flux 
constraint can no longer be satisfied by the small population of thermally-activated a states. The states become 
mechanically activated, with particles in the immobile (3 state promoted into the mobile state by the driving force. 
As u increases through the shaded part of the figure, the unequal hopping rates to the right and left remain 
approximately constant, while the rate of activation U increases and rate of trapping D decreases, so that the 
drifting a population increases. This is also apparent in Fig. |S1 which shows the occupancies of the two states as 
a function of velocity, for this same set of parameters. 

Once the mobile states are fully populated, and the trapped states have negligible occupancy, the bias on hops 
to the right can no longer remain constant while satisfying an increasing flux constraint. Hence, a third regime 
exists at the highest values of u (Fig. 0)), where rates R and U both become proportional to the flux u, while the 
flux-impeding transitions have rates L and D inversely proportional to u. 
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FIG. 4: The transition rates of the driven comb model, shown on log-log axes as functions of the positive drift velocity u. 
Parameter values are E — 6, p — 100. Three regimes of behaviour are visible. The regime at intermediate values of u is 
shaded. 
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FIG. 5: Occupancies of a and /3 states for the same parameters as in Fig. 0J shown as functions of the drift velocity, on 
semi-logarithmic axes. Three zones of behaviour are again visible: Boltzmann-distributed states at low u; re-population of 
states at intermediate u; the mobile a state fully occupied at high it. 



C. Comparison with shear flow 

The model we have studied here, with its simple comb-shaped state space, has some features that are generic to 
driven systems. We saw, in section llll CI that the rate of a transition in a driven ensemble depends on three factors: 
its rate at equiUbrium, the amount of flux that it contributes, and the difference in the wiUingness of the initial 
and final states to allow the required flux in the future. Transitions that contribute a non-zero amount of flux were 
called "type A" in Ref. 0, while transitions between states with different promise for future flux where labelled 
"type B" . In previous articles 1^ , the rates prescribed by sub-ensemble dynamics were calculated explicitly only 
for simple models, that exhibited only type A transitions due to the simplicity of their state spaces. The comb 
model, on the other hand, has both type A (a ^ a) and type B (a ^ /3) transitions. Another example of such a 
model, that was previously discussed only heuristically 0, Q , is a set of dimers (particles that occupy two adjacent 
lattice sites) that perform random walks on a two-dimensional triangular lattice, while the lattice itself is driven 
into shear flow by sporadically cleaving and re-positioning its horizontal layers. Certain arrangements of the dimers 
(analogous to a states of the comb model) allow these quanta of shear, while other states (analogous to (5 states) 
are prevented from shearing, due to the unbreakable dimers straddling two layers of the lattice, thus jamming the 
system. 

Although any such many-particle system has a very complex state-space, its crucial features are reproduced in 
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the comb model. When the comb model is stuck in a /3 state, the driving force (that derives from the statistics of 
the driven ensemble) pushes it into a more mobile state in order to flow. Likewise, when the dimer model is in a 
state that will not admit a flux, it must first re-arrange its particles. The driving force achieves this by imposing 
a shear stress on the particles, causing them to re-orient mechanically (as opposed to thermally, by Brownian 
motion). The sub-ensemble rules prescribe (for a given prior dynamics) the rate of that mechanically imposed 
re- alignment, thereby specifying the constraints that must be met by a physically acceptable constitutive relation 
for the flowing system. 

V. SUMMARY AND OUTLOOK 

There are certain constraints that must be satisfied by any candidate for a statistical mechanical theory of driven 
steady states: it must satisfy the known laws of motion, and it must give rise to the required macroscopic observables 
(flux, energy etc.). In this article, we have assumed; indeed, demanded; that those are the only constraints, and 
derived the transition rates implied by that assumption. Comparison with experimental observations will determine 
a posteriori whether a particular system belongs to the ergodic class that is well described by these unbiased rates, 
just as empirical comparison determines whether or not a static system is at thermodynamic equilibrium. If one is 
privy to prior information indicating that the driven system's motion is biased in some way that is not apparent 
in its macroscopic flux and conserved quantities, then the dynamical rules set out here should be disregarded. To 
violate the rules a priori without such a justification is to bias the model with arbitrary information derived from 
prejudice rather than from physics. Such arbitrariness is not condoned for equilibrium models, and the same should 
be the case for macroscopically driven steady states. 

For example, consider how we design a stochastic model of an equilibrium system. The system is defined by some 
set of available states, and we must choose the rates of transitions between those states. Canonical equilibrium 
is defined by a fixed volume, particle number, and mean energy of the system. We might choose any arbitrary 
set of rates, and then measure or calculate the mean energy that results when the system arrives at a steady 
state. Certainly, that procedure would give rise to a well defined mean energy, volume and number, but that is 
not sufficient for us to say that the system is at equilibrium and that the transition rates are acceptable. There 
are constraints arising from the principle of detailed balance, which ensure that (E, N, V) are the only parameters 
characterising the macroscopic state of the ensemble, beyond the definition of the system in terms of its accessible 
states and reversibility. We have found the generalisation of those constraints to non-equilibrium steady states. 

The prior is central to the formalism, and is often misinterpreted in non-equilibrium applications of information 
theory. In the present context, it is used to mean the complete set of physically valid paths that a system might 
take in response to the stochastic forces arising from a particular coupling to a non-equilibrium reservoir. If the 
reservoir can exchange energy with the system, then conservation of energy can be violated in the prior. If the 
coupling is only to particles at the system's boundary, then energy, momentum and angular momentum must be 
conserved by all internal interactions, so the prior does not include scenarios for which those laws are violated. 

This has not been the usual definition of the prior, in previous attempts at non-equilibrium applications of 
information theory. It is often assumed that our knowledge of microscopic dynamics can be discarded, and MaxEnt 
will correctly reconstruct that missing information. Such optimism cannot be justified. For instance, MaxEnt has 
been used to choose between phase-space paths that are characterised by their actions 0| , discarding our knowledge 
of Hamilton's principle of least action. The result is an exponential distribution in which the paths of least action 
are the most likely, but that result is incorrect. Paths on which the action is extremized are not just likely; they 
are the only paths of a classical system, and therefore the only paths that should appear in the prior if an exact 
calculation is wanted. 

The central results of this paper are the formulae for transition rates in a driven ensemble. These are formulated 
in two alternative ways. In the "microcanonical-flux ensemble" , the flux is constrained to an exact value when time- 
averaged over the duration (tending to infinity) of each system's passage through phase-space, resulting in Eq. H12() . 
The "canonical-flux ensemble" , in which only the ensemble-averaged flux is constrained, leads to Eq. 1)24(1 for the 
transition rates, which is exactly equivalent to the microcanonical-flux prescription. The canonical-flux equation 
H24|l makes explicit the three factors influencing a transition rate. As at equilibrium, energetics are important, 
making a system reluctant to take up-hill steps in its energy landscape. Secondly, an exponential factor, that one 
might have guessed, favours transitions that impart the desired flux. The third factor prescribed by Eq. (|24|) is 
more subtle. It describes the importance of correlations, and depends on a well-defined quantity ascribed to each 
microstate, that quantifies its promise for future flux. A transition is favoured if it takes the system to a state of 
higher promise, that is more amenable to future flux-carrying transitions. 

The sub-ensemble scheme has previously been demonstrated to produce the standard equations of motion for 
diffusion with drift, both for continuous 0] and discrete random walks. In the present article, the dynamical 
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rules were evaluated for a more complex model. We have seen that, for thermally activated processes, that are 
governed by detailed balance at equilibrium, the sub-ensemble rules describe mechanical activation by the driving 
force, although detailed balance is recovered in the low flux regime. 

In the context of shear flow, mechanical activation corresponds to stress-induced re- arrangement. The fact that 
this statistical formalism describes the effects of non-equilibrium stresses in a natural way, makes it a promising 
approach for the study of shear-banding, jamming, and other shear-induced transitions of complex fluids. 

At the risk of repetition, we have a recipe for constructing a model of any given driven system, that is guaranteed 
to yield the desired flux, and to respect all the physical laws that are obeyed by the equilibrium version of the 
model. It is also guaranteed to have no artefacts from statistical bias. This machinery has the capacity to produce 
exact physics if provided with an exactly physical prior. Otherwise, it will yield the least arbitrary model for the 
given degree of approximation. 
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APPENDIX A: THE CANONICAL-FLUX POTENTIALS 

As stated in Eq. 1)16(1 . the distribution of flux J during interval f, that appears in Eq. 114|l . is uncorrelated with 
the initial state a, and can therefore be written 

Pf{J\a, J) ^ MJ) ^ J^p'^'^lc) p^'>(J|c) (Al) 

c 

in terms of the instantaneous steady-state distribution of states p'^'(c). The distribution ff{J) can be evaluate if 
we sub-divide the interval f into n sub-intervals of duration t, where n ^ 1 since f tq 3> t. The system begins 
each of these sub- intervals in a state drawn randomly and independently from the steady-state distribution p'^^{c). 
These initial states are uncorrelated because t » Tcor- The overall flux in the interval f is the mean of the fluxes 
in these n independent sub-intervals, so that 

/oo \ ^ 

dJi . . . dJ„ fr{Jl) . . . fr{Jn) -^l^ " " ^ J^) ■ (A2) 
-oo , 

This limit distribution gives the likelihood (under equilibrium dynamics, with a non-equilibrium initial state) that 
the n independent flux measurements have an improbably-large mean value J. Gamer's theorem of large deviations 
|l.l| | states that the weight in the tail of the distribution of the mean of n independent identically distributed random 
variables behaves as 

1 f°° 

lim -In / fnr{J')dJ' ^ ~I{J,t). (A3) 



(A4) 



That is, the weight in the tail decays exponentially with n, at a rate / given by 



/(J, r) = sup 



/oc 
-oo 



Dividing both sides of Eq. I|A3|I by the constant t gives 

lim \ In r ff{J') dJ' - Zlldlll, (A5) 

f/r^oo T J j T 

Since the LHS of Eq. I|A5|) is independent of the arbitrary choice of r, we can infer that / oc t. Let us define the 
function 

H{J)=I{J,t)/t (A6) 
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that is independent of the arbitrary quantity r. Writing the exponential decay law explicitly, with an unknown 
prefactor A{J) that varies only slowly with f, 

/f(J')dJ' ^ A{J) exp -tH{J) as r/f ^ 

and differentiating with respect to J gives 

ff{J) ^ \a{J)tH'{J) - A'{J)] exp \~tH{J)] . (A7) 

Now, substituting for J from Eq. H15() and Taylor-expanding H{J) to first order in r/f allows us to take the limit 
f/r — > oo when substituting Eqs. l(TI|l . (|A1(I and HA7|I into ifT^ . yielding 



UJ 



dr 



lim 



(A8) 



The supremum in Eq. ljA4p can be evaluated by defining the functions in Eqs. (|19f) and H2U|I . From Eqs. (jAlj) . 
4]) and ljA6p . we have 



H{J) = vJ -Q{v) 



(A9) 



with the parameter i/(J) [equal to 6'/r in Eq. (|A4|) ] given by Eq. H18|) . Thus the parameter H'{Jq) in Eq. IjASp can 
be evaluated by differentiating Eq. (|A9(I and substituting from Eq. H18|l . to give 



dH 



(AlO) 



resulting in Eq. H17|l . Note that Q is a Legendre transform of H, and that v and J in Eqs. I|A10|I and (|18(l are 
conjugate variables. 



APPENDIX B: RE-FORMULATION OF THE CANONICAL FLUX EXPRESSION FOR TRANSITION 

RATES 

Let us make a change of variable in Eq. H17|) . and replace the integration over average flux J by one over total 
(integrated) flux K = tJ . Then, using p°^{K\ . . .) AK now to represent the normalized probability of flnding a 
total flux K on an equilibrium trajectory of length r, we can write 

Now, the expression p'^{K\a —>■ b) is the probability of accumulating an integrated flux K during interval r, given 
that the initial part At of that interval is taken up with a transition from state a to 6. Since that transition carries 
an integrated flux Kab = Jab At, we can replace the expression by the probability of accumulating the remaining 
flux K — Kab in the remaining time, starting from state &, i.e. 

p';'i{K\a^b)=pl'i^, {K-Kab\ b). 
Hence, after a change of variable, Eq. (jBl|) gives 

ln-§^ = '^Kab+ lim [mb{iy,T ~ At) ~ ma{iy,T)] (B2) 

^a^b 

= i^Kab + lim [mb{iy, t) - ma{i^, r)] - (bii^, At) (B3) 

T — *00 

where 

Cbiiy, At) = lim [mb{i^, t) - rtibiy, t - At)] (B4) 
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and, by substituting r — > r + At into Eq. ljB2p . we find Ca = Cb ^ i-S-, Cft is state- independent. Given that the 
limit in Eq. (|B4p exists, we can write 

Cbii^,At)^Atlmi^(^^^ (B5) 

even for finite At, since m?, asymptotes to a Unear function of t. The state-independence of Cfc can now be used 
to factor out the time-derivate of m from the ensemble average when differentiating Eq. (|19|) with respect to r, 
yielding 

Cbiiy,At) = Q{iy)Atyb. (B6) 
Finally, substituting Eq. (|B6|) into l|B3|l gives a very simple expression for the ratio of transition rates, 



In -i^ = i^Kab - Q{v)At + lim [mb{v, t) - ma{v, r)] (B7) 



from which Eq. 1)24(1 follows. 



APPENDIX C: CALCULATION FOR CONTINUOUS-TIME HOPPING ON A COMB 

For the discrete states of the comb model of section llV Al with the integrated flux J At quantized into discrete 
values of the displacement x, Eq. (|20|l becomes 

oo 

X — — 00 

where G'/3(x,t) is the equilibrium Green function for [3 states. That is the probability of attaining a displacement 
X in time r given that the particle initially occupies a (3 state. An equivalent expression holds for TOq. 

In the continuous-time model, a particle occupying state [3 at time will escape to the corresponding a state at 
a time t that is drawn stochastically from the exponential probability distribution ppa{t) = U°'^ exp{—U'^H) . Once 
excited to the a state, the particle is governed by the corresponding Green function G'q(x,t), so that the Green 
function for a particle occupying state (3 is given by 

G/3(x,t) = / ppa{T - t) Ga{x,t) dt 
JQ 

= f/cqg-cz-r retz-*G„(x,t)di 
Jo 

from which it follows that 
Differentiating with respect to r yields 







In the limit of large r, the time derivative of is just Q, as given by Eq. l|B6p . so that, with the definition of 
in Eq. l(^ . the required result, Eq. follows. 
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